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Abstract. Matrix pencils, or pairs of matrices, may be used in a variety of 
applications. In particular, a pair of matrices (E, A) may be interpreted as the 
differential equation Ex' + Ax = 0. Such an equation is invariant by changes 
of variables, or linear combination of the equations. This change of variables 
or equations is associated to a group action. The invariants corresponding to 
this group action are well known, namely the Kronecker indices and divisors. 
Similarly, for another group action corresponding to the weak equivalence, a 
complete set of invariants is also known, among others the strangeness. 

We show how to define those invariants in a directly invariant fashion, i.e. 
without using a basis or an extra Euclidean structure. To this end, we will 
define a reduction process which produces a new system out of the original 
one. The various invariants may then be defined from operators related to 
the repeated application of the reduction process. We then show the relation 
between the invariants and the reduced subspacc dimensions, and the relation 
with the regular pencil condition. This is all done using invariant tools only. 

Making special choices of basis then allows to construct the Kronecker 
canonical form. In a related manner, we construct the strangeness canonical 
form associated to weak equivalence. 



1.1. Equivalence. The primary study of this paper is that of pairs of matrices, 
also called matrix pencils. In other words, we study pairs of operators (E, A) both 
acting from a finite dimensional vector spaces M to a finite dimensional vector 
space V. 

A typical example we have in mind is the linear differential equation 



Such a model is clearly invariant by changes of variable, or by changing the order 
of the equations. More precisely, it is invariant by simultaneous equivalence trans- 
formation of the operators E and A. The corresponding equivalence relation is the 
following: two pairs of operators (E l5 Ai) and (E 2 , A 2 ) will be considered equivalent 
if there exists invertible operators P and Q, operating on M and V respectively, 
such that 



This equivalence relation is associated to a group, which is simply GL{M) x GL(V). 
This is called strong equivalence in [1]. We are interested in properties which are 
invariant with respect to that group action on the matrix pencil. In other words, 
we are interested in quantities that label the orbit of the group action. 
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E 2 = PEiQ, 
A 2 = PAiQ. 
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In fact, a complete set of invariants and a canonical form have been known 
since the works of [16] and [6]. Modern versions of those proofs may be found 
in [1, § XII. 4] and in [2, § A. 7]. The primary tool for obtaining those invariants 
is the Jordan canonical form. For that reason, those proofs are impossible to 
extend to nearby cases, for example to the infinite dimensional case, or to the 
parameter dependent case, not to mention the numerical difficulties associated with 
the computation of the Jordan canonical form. 

As a result, alternative proof techniques were developed, most notably in [13] 
and [17]. Those authors observed indeed that using a Jordan canonical form is not 
suitable to compute the invariants other than the Jordan invariants, i.e., the Kro- 
necker indices and the "infinite elementary divisors" [13]. The idea is to transform 
the pair of matrices into a form which exhibits all the invariants but is not a canon- 
ical form. Those forms are known under the names of generalized Schur-staircase 
form, or GUPTRI (Generalized Upper Triangular Form). We refer to [4, §4.1] and 
[8] for more references on those algorithms. 

Our approach is similar, although with a shift of focus towards the underlying al- 
gebraic structures as opposed to the algorithmic aspects. In particular, we attempt 
to define the invariants from the dimensions of subspaces which are themselves in- 
variants with respect to the equivalence relation at hand. The advantage of our 
approach is that a great deal of results are automatically independent of the choice 
of a basis, or any other structure (like a Euclidean structure). 

1.2. Invariants. A matrix pencil, when considered as a differential equation (1), 
may be decomposed in an intrinsic ordinary differential equation, and an extra 
structure. We will call the invariants of the underlying ordinary differential equa- 
tion the dynamical invariants, and we will call the remaining invariants the non- 
dynamical invariants. In the parlance of the Kronecker decomposition theorem as 
presented in [1], the dynamical invariants would be the finite elementary divisors 
(essentially a Jordan form), whereas the non-dynamical invariants would be the 
infinite elementary divisors along with the row and column minimal indices. 

The dynamical invariants, i.e., the invariants of the intrinsic differential equations 
boil down to the Jordan invariant associated to similarity transformations, and are 
therefore of less interest to us. We will thus mostly focus on the non-dynamical 
invariants, which appear only when E is not invertiblc. Those invariants are well- 
known in control theory, and in the study of differential algebraic equations. In 
control theory, such invariants are the controllability and observability indices ([5, 
§ 6.3]), for differential algebraic equations (DAE), in the case of a regular pencil (see 
subsection 3.8), the most used non-dynamical invariant is the index ([3, VII. 1]). 

Our goal is to define the non-dynamical invariants in a invariant manner, without 
any other structure than the linear algebraic structure. 

1.3. Reduction. The crucial tool to the study of the invariants of a pencil is the 
concept of reduction, which we define precisely in Section 2. 

This concept was gradually developed, under various names, or no name at all, 
first in [18] for the study of regular pencils, then in [17, §4] and [13] to prove the 
Kronecker decomposition theorem. It is also related to the geometric reduction of 
nonlinear implicit differential equations as described in [10] or [9]. In the linear 
case, those coincide with the observation reduction, as shown in [14]. It is also 
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equivalent to the algorithm of prolongation of ordinary differential equation in the 
formal theory of differential equations, as shown in [12]. 

The reduction procedure is an operation that, out of a pair of operators (E, A), 
creates a new, smaller one (E',A'). "Smaller" is in the sense that the reduced 
operators E' and A' are restrictions of E and A on subspaces of M and V, defined 
by V := EM and M' := A^V '. 

This process of reduction is iterated, producing systems (E^ k \A^) and sub- 
spaces MW and yW. This process ultimately stops, and we will call the number 
of steps before it stops the index. When the process stops, the system which 
is produced, denoted by (E^°°\ A^ 00 )), is such that E^°°' is surjective. After run- 
ning the reduction algorithm once more on the dual of that reduced system, i.e., 
on (E(°°)*, A(°°)*), one obtains an isolated system (E^M 00 ), A(°°>(°°)) such that 
E(oo)*(oo) j g now i nvcr tible. 

At each step of the reduction, some information from the original system is lost. 
That information is encoded by integers called "defects" . Those defects are of three 
kinds: a, j3 + and /3~ . The defect a\ is defined as the dimension of the kernel of E, 
regarded as a quotient operator from M/M' to V /V" . The defect /?+ is defined as 
the dimension of the cokcrnel of A, regarded as a quotient operator from M/M' to 
V/V. The iterated reduction then generates the sequences of defects otk and (3^. 
The defects (3~ are defined as the /3+ defects of the system (e(°°)*, A* 00 )*). 

Using those subspaces, defined in an invariant manner, we are able to show the 
following facts: 

• the operator E is invertible if and only if all the defects vanish 

• the pair (E, A) is a regular pencil if and only if the f3 + and j3~ defects vanish 

• we show that the invariants defined in [7], like the strangeness, may also 
be defined directly in an invariant manner, i.e., without using any extra 
structure or basis 

We also show that the defects and the system (E^M 00 ), /\(°°)*(°°)) completely 
characterize the equivalence class corresponding to the equivalence relation (2). 

• the defects are related to the Kronecker indices 

• the invariants defined in [7] may be used to construct a corresponding 
canonical form for weak equivalence: this connects the approaches of [17] 
and [7] 

• using the relation with the Kronecker decomposition theorem, we show 
that the defects of the dual system (E*,A*) are related to those of (E,A) 
by switching the /3 + and /?~ defects. 

1.4. Outline. The layout of the paper is as follows. 

In the first part, Section 2 and Section 3, we show how to derive the non- 
dynamical invariants. In Section 2 we define the reduction procedure. In Section 3 
we define the defects of a system, and study their properties. In particular, we give 
an original proof of the relation between the property of a pencil to be regular, and 
the presence of some of the defects. 

In the second part, we show that the invariants obtain in the first part, namely 
the defects, supplemented by a Jordan structure, are the only invariants of the pair 
of matrices with respect to equivalence. Most of the results in this part are already 
in [17] and [13]. In Section 4 we prove the basic lemmas needed to construct 
canonical forms. In Section 5, we show how to use those tools to construct a 
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canonical form with respect to weak equivalence In Section 6 and Section 7 we 
show that the defects determine a complete canonical form. In Section 8 we study 
the relation with the existing Kronecker canonical form. 

2. System Reduction 

2.1. Setting. 

Definition 2.1. We will call a pair of linear operators (E, A) a linear system, or 
simply a system, if E and A have the same domain and codomain, both of finite 
dimension. 

Given a system (E, A), we will denote the common domain of E and A by 
-^(E,A) an d the common codomain of E and A by V( ElA ), so a system (E, A) may be 
represented as 

E, A : M (EiA) — > V (E ,A). 

2.2. Reduced spaces. The idea behind the reduction of a linear system (E, A) is 
to "disentangle" the spaces associated with the operators E and A. The strategy 
pursued is to try and make the operator E surjective, by successive reduction steps. 
In order to achieve this, we have to describe the lack of surjectivity of E, first 
independently of A, which leads to the definition of the subspace 

V' := EM. 

The next step is now to describe the lack of surjectivity of E, with respect to A, 
which we measure using the subspace 

M' := A"V. 

Remark 2.2. Those definitions make sense when considering the differential equa- 
tion 

Ex' + Ax = 0. 

Notice that any suitable initial condition for this equation must be in M' . If the 
initial condition is not in M', there cannot be any solution stemming from that 
initial condition. 

Let us put those definitions together: 

Definition 2.3. Given a linear system (E, A) we define its reduced codomain 

V(' E A ) Reduced codomain as 

V(e,a) ■= EM (EiA) , 
and its reduced domain ilf| E ^ as 

M (E,A) ~ A^e.a) = {* e M (EiA) : Ax e V ( ' E>A) }. 

Remark 2.4. We will often drop the dependency on the system (E, A), and simply 
write M, M' , V and V when the context is clear enough. 

Remark 2.5. As explained in [14, §5.1], the reduction of Definition 2.3 corresponds 
to the non-linear reduction of general systems of differential equations with con- 
straints. The study of differential equations is also the point of departure in [17]. 
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Remark 2.6. One of the first occurrence of the definition of that subspace M' seems 
to be in [18, Lemma 2.1]. It is used to study systems which are regular pencils (see 
Definition 3.19). 

Another explicit definition is to be found in [11, §7], although with a different 
purpose than ours, namely the study of linear, time-varying differential algebraic 
equations of index one. 

2.3. System Reduction. The subspaces and defined in Definition 2.3 

allow for defining a new system. This procedure will be called "reduction" . 

Proposition 2.7. Given a system (E,A), the operators E' and A' are uniquely 
defined by the following commuting diagram. 

E,A 

M > V 



E',A' 

M' > V 

The vertical arrows are canonical injection from a subspace into the ambient space. 

The operators E' and A! build up a new system (E, A)' which we call the reduced 
system, and is defined by 

(E,A)':=(E',A'). 
Proof. The proof rests on the observation that 



E%a)^ ( e,a) and AM ( ' E >A) C V ( ' E>A) . 



□ 



Remark 2.8. Consider the category which objects are vector spaces and arrows are 
systems as defined in Definition 2.1. The reduction operation, denoted by a prime, 
is an endofunctor in this category, i.e., a functor from that category to itself. 

As we mentioned in the beginning of subsection 2.2, our goal is to obtain a 
reduced system such that E is surjective. It is only part of a general strategy to 
obtain a reduced system where E is invertiblc. It is therefore important that the 
reduction algorithm does not alter the injectivity of E. We observe that this is 
indeed the case. 

Proposition 2.9. //, in a system (E, A), E is injective, then E' is also injective. 
Proof. It is a consequence of the observation that 

ker E' C ker E. 

□ 

The pendant of that observation is the equally simple observation regarding the 
kernel of the operator A with respect to the reduced space M': 

Proposition 2.10. Given a system (E, A) 7 the null-space of A is included in M'^ E 
i.e., 

ker A C M ( ' E A) . 



6 



OLIVIER VERDIER 



2.4. Iterated Reduction. We may iterate the reduction process described in 
subsection 2.3 on the new system (E, A)'. This leads to a sequence of systems 
{(E, A)( fe )}fc £ N which is defined recursively as follows. 

Definition 2.11. The iterated of the reduction of a system (E,A) are defined 
recursively by 

( E (fc+i) )A (fc+i)) := ( E «,A«)', Vfc > 0, 

and 

(E(°),A<°>) := (E, A). 
We will make use of the straightforward notation, for k € N. 

M ( ( E fc) A) :=M (E>A)(fc) , 

V (EA) : = ^(E,A)(*) ■ 

The reduced operators E^) and A^ are essentially restrictions of the original op- 
erators E and A, so we may rewrite the definition of the iterated reduced subspaces 
AfW and V^ k \ 

Proposition 2.12. For a system (E, A) the following assertions hold for any integer 
k > 0: 

Vx e M {k) E (k) x = Ex A {k) x = Ax, 
v (k+i) = EM (fe) ; 

M( fe+1 ) = {x e : A S £^ +1 »}. 

Proof. The proof is a simple verification by induction on fc. □ 

2.5. Totally Reduced Systems. As we shall notice in subsection 2.7, the re- 
peated operation of reduction transforms a system into one which cannot be re- 
duced anymore, or rather, for which the reduction does not create a new system. 
We call such systems "totally reduced" : 

Definition 2.13. We will say that a system (E, A) is totally reduced if 

(E,A)' = (E,A). 

A practical characterisation of a totally reduced system is that V = V. The 
verification is straightforward. 

Proposition 2.14. A system (E, A) is totally reduced if and only if 

V (E,A) = V (E-A)- 

2.6. Almost Reduced System. 

Definition 2.15. We will say that a system (E, A) is almost reduced if 

M[e,a) =M (EiA) . 
The chosen vocabulary is supported by the following facts: 

• a totally reduced system is also almost reduced, which follows from Defini- 
tion 2.3. 

• a system which is almost reduced will be totally reduced at the next step 
of the reduction, since by Proposition 2.12: V" = EM' = EM = V . 
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Remark 2.16. In the situation of a reduced system which is almost but not totally 
reduced, the following subspace sequences 

M (n+l) =M (n) C ...CM" CM' CM 
y(n+2) _ y{n+\) c y(n) <_ _ _ _ c y» <_ yl <_ y 

would be produced. 

A concrete example where this happens is when A = and E is not surjective. 
It is clear that M' — M but V C V. The corresponding system is thus almost 
reduced but not totally reduced. 

2.7. Index. The reduction procedure produces decreasing sequences of subspaces. 
When both sequences stall, the system is totally reduced. The number of reduction 
steps needed to transform a system into a totally reduced one is called the index of 
the system (E, A): 

Definition 2.17. The smallest integer n e N for which the system (E^ n \ A^™') is 
totally reduced is called the index of the system (E, A). 

We will use the following notation for the index of the system (E, A): 

ind (EiA) :=min{ne N: (E,A)( n+1 > = (E,A)( n '}. 

Remark 2.18. The index is always a finite integer 1 , and the reduced system (E', A') 
has an index dropped by one, i.e, 

ind (E ,A)' = ind (E ,A) -1- 

Those observations will be used repeatedly to prove statements by induction on 
the index (e.g., in Proposition 3.20, Theorem 6.1 and Theorem 8.2). 

Remark 2.19. Using Proposition 2.14 we observe that 

ind (EiA ) = min{n e N : V (n+1) = V (n) ). 

Remark 2.20. The index defined in Definition 2.17 is closely related to the geometric 
index defined in [10], [9] or [14, §5.1]. In fact, the geometric index would be the 
first integer n such that the system (E,A)(") is almost reduced (Definition 2.15). 
As we shall see in Corollary 3.17 and Proposition 3.20, this minor difference is only 
relevant for singular pencils. 

2.8. Totally Reduced System. 

Definition 2.21. For a system (E, A) of index n = ind( EjA ) we define the totally 
reduced system as 

(E (oo) ,A (oo) ) := (E (n) ,A (ll) ). 

Remark 2.22. We could simply have defined, say E^°°^ by the limit of the sequence of 
operators E^ (because this sequence eventually stalls), which explains the notation 

"oo". 

We pointed out in subsection 2.2 that the idea behind the reduction procedure 
was to lead to a system where E is surjective. The reduction algorithm indeed 
achieves this goal: 

Proposition 2.23. The totally reduced operator E^ 00 ) is surjective. 



as opposed to the differentiation index, which is infinite in the non-regular pencil case; see, 
e.g., [3, § VII.l]. 
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Proof. The system (E(°°\ A(°°>) is totally reduced so we may use Proposition 2.14 
to conclude that E^M^ 00 ) = (0°°))' = V^°°\ so E(°°) is surjective. □ 

3. Defects 

3.1. Quotient Operators. At each step of the reduction some information is lost, 
by passing from the original system to the reduced one. We capture that informa- 
tion loss by two quotient operators defined on the quotient space M/M' . 

Proposition 3.1. The following commutating diagrams uniquely define the quo- 
tient operators [A] and [E] (the vertical arrows are the natural projections on a 
quotient space). 

M > V M E > V 



* A * + b * 

M/M' -^U V/V M/M' -^-> V' IV" 

Moreover, [A] is infective and [E] is surjective. 

Proof. The quotient operators [A] and [E] are well defined because AM' C V and 
EM' C V" (since in fact, EM' = V" by definition). [E] is surjective because E is 
surjective onto V by definition of V . [A] is injective since, by definition of M', 

Ax e V => x e M'. 

□ 

3.2. Constraint and Observation Defects. Since [A] is injective and [E] is sur- 
jective, the information stemming from those operators are to be collected in the 
cokernel of [A] and the kernel of [E]. The dimension of those subspaces are impor- 
tant invariants of the system (E, A) which we now precisely define. 

Definition 3.2. Let [E] and [A] be defined as in Proposition 3.1. We measure the 
lack of surjectivity of [A] by the first observation defect f3^(E, A), defined as 

(4) /3+(E,A) := dimcoker[A] 

and the lack of injectivity of [E] by the first constraint defect ai(E, A), defined 

as 

(5) ai(E,A) :=dimker[E]. 

Now we take advantage of the reduction procedure and define those defects 
recursively: 

Definition 3.3. The constraint defects afe(E, A) of a system (E, A) are defined 
for any integer k > 1 by 

a fc (E,A) :=ai((E, A)^ 1 )). 
Similarly, the observation defects /3^(E, A) are defined for any integer k > 1 by 

/3+(E,A) :=#((E, A)^ 1 )). 
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3.3. Control Defects. There is another important kind of defect that will be 
needed. In is obtained by considering the dual of the totally reduced system ob- 
tained after repeated reductions. That totally reduced system (E(°°), A^ 00 )) is such 
that E(°°) is surjective, so E^ 00 )* is injective. So what happens for a system (E, A) 
such that E is injective? It turns out that such a system has no constraint defects. 

Proposition 3.4. Given a system (E,A), if E is injective, then the system has no 
constraint defects, i.e., for all integer k > I, afe(E, A) = 0. 

Proof. The proof proceeds by induction on the index. 

(1) If the index is zero, then M' = M and V — V, so dimker[E] = 0. 

(2) For a positive index, using Proposition 2.9 we may apply the induction 
hypothesis and deduce that ojfc(E, A) = for k > 2. 

(3) Now if x + M' e kcr[E] then Ex <E EM'. Since E is injective, this means 
that x e M' and thus that ker[E] = 0. We conclude that «i(E, A) = 0. 

□ 

Let us introduce the notion of a dual system. 

Notation 3.5. Given a system (E,A) we define the dual system (E,A)* by the 
pair of adjoint operators (E*,A*), i.e., 

(E, A)* := (E*,A*). 

Proposition 3.6. The dual (E^°°\ /\(°°)y f a totally reduced system has no con- 
straint defects, i.e., 

a (E(°°>,A(°°>) =0. 

Proof. According to Proposition 2.23, the operator E^ 00 ) is surjective, so E(°°> is 
injective, and we conclude using Proposition 3.4. □ 

This suggests that another set of defects is given by the observation defects of 
the dual of the totally reduced system (E(°°), A(°°)). 

Definition 3.7. Given a system (E, A), we define the control defects (3^ (E, A) 

by 

j 8fc(E,A):= / 8+(E(°°)* ) A( 00 )*) Vfc > 1. 

3.4. Intrinsic Dynamical System. The reduction procedure may thus be used 
once to obtain a totally reduced system, and may then be applied again to the dual 
of that totally reduced system. 

Starting with a system (E, A), we may completely reduce it to obtain the system 
(E(°°), A(°°)). The operator E^°°> is injective. The adjoint system (E^ 00 )*, A^°°>) 
may be in turn completely reduced to obtain the system (E^ 00 )*^ 00 ), A^ 00 ^ 00 )). 
Using Proposition 2.23 and Proposition 2.9, we obtain the following result. 

Proposition 3.8. The operator E^ 00 '^ 00 ) j s invertible. 

Since the operator E^ 00 '*^ 00 ' is invertible, its domain and co-domain have the 
same dimension. This dimension is the dimension of the intrinsic dynamics of the 
system. 

Definition 3.9. The dynamical dimension S of the system (E, A) is defined by 
the integer 

5 := dimM^ 00 ^ 00 ) = dimV^*^ . 
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Remark 3.10. For a differential equation defined by the system (E, A), the system 

^£(oo)*(oo)* y^(oo)*(oo)*^ 

corresponds to the underlying differential equation. In particular, the dynamical 
dimension 5 determines the degrees of freedom for the choice of the initial condition. 

3.5. Dimensions of the Subspaces. In order to study the relations existing be- 
tween the defects and the various subspaces and V^ k \ we define the following 
spaces, which measure the difference of dimension between each successive reduc- 
tion: 

Definition 3.11. Recalling Definition 2.11, for any integer k > 1 we define the 
spaces 

AM {k) := M {k ~ 1] /M {k) 

and 

Ay(k) ._ y(fc-i) 

By definition of the defects in Definition 3.2 and using Proposition 3.1, one 
obtains the relations 

dim AM (fe) = dim AV (k+1) + a k , Vk > 1, 
(6) ~ 
dimAy (fe) = dimAM (fc) + (3+ , Vfc > 1, 

between the dimensions of the spaces defined in Definition 3.11 and the defects. 
For any integer k > 1 this implies the inequalities 

• • • < dim AM (H1) < dim AV {k+1) < dimAM (fe - 1} < dim AV [k) < ■ ■ ■ . 

Remark 3.12. This is the same sequence of inequalities as in [17, 5.2]. 

In particular, the dimensions of the spaces AM^ and AV^ may be expressed 
using the constraint and observation defects. 

Lemma 3.13. For any integer k > 1, the dimensions of the spaces AM^ and 
AV^ are related to the defects by the identities 

dimAF (fe) = ^(a j +/?/), 
j>k 

dimAM^ =^2( aj +(3f +1 ). 

j>k 

Proof. Those identities follow from an induction based on (6) and the observation 
that the integers dimAT^) and dimAM( fc ) are zero when k is bigger than the 
index of the system. □ 

Remark 3.14. As we shall see in Theorem 5.1, the quantity defined in [7] as the 
"strangeness" s turns out to be the integer 

s^dimAV". 

Roughly speaking it expresses the number of constraints that, when differentiated, 
will help to reduce the system. 

We may thus give the precise relation of the strangeness to the defects using 
Lemma 3.13, namely 



dim AF" = J2Pk +J2 ak - 



k=2 k=2 
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The dimensions of the spaces M and V may also be expressed from the defects 
and the dynamical dimension 6 (see Definition 3.9). 

Proposition 3.15. The dimensions of M, V, the defects a, (3 + and f3~ and the 

dynamical dimension 6 are related by the formulae 



dimM = 8 + J2ka k + J2k/3 k + ^ fc/3+ +1 , 

k>l k>l k>l 

dimV = 5 + Y / ka k + Y / + £ k[3^ +v 

k>l k>l k>l 

Proof. First observe that since dimU^ = dimV r (' £ + 1 )+dimA^( fc+1 ) and dim = 
dimM( fe+1 ) +dimAM( fe+1 ), we have 

dim M = dim M (oo) + ^ dim AM (fc) dim V = dim U (oo) + ^ dim AV (k) . 

k>l k>l 

Using Lemma 3.13 we obtain 

dim V = dim V (oo) + ^ ka k + ^ k/3+, 
fe>i fc>i 

and 

dimM = dim M* 00 * + ^ fcafc + H fc #fe+r 
fe>i fe>i 

Now using the observation of Proposition 3.6 that a(E(°°)*, A^ 00 )*) = 0, along with 
Definition 3.7 of the defects [3~ and Definition 3.9 of the dynamical dimension 5 we 
readily obtain the result. □ 

3.6. Relation with the Index. The index is, as expected, a non-dynamical in- 
variant. More precisely, it is a function of the defects, as the following proposition 
shows: 

Proposition 3.16. The index ind(E,A) ( see Definition 2.17) of a linear system 
(E, A) is given by 

ind (EiA) = min{n e N : Vfc > n a fe (E,A) = and /3+(E,A) = 0}. 
Proof. Following Remark 2.19, the index fulfills 

ind( E , A ) = mm dim AU (A;+1) = 0. 

Using Lemma 3.13 we thus obtain 

dim AU (fe) = ^ otj + 0+ = Vj > k + 1, 
which proves the claim. □ 

In the case of a system without observation defects we obtain readily: 

Corollary 3.17. The index of a system (E, A) without observation defects (i.e., 
f3 + = 0) is the biggest index of non-zero constraint defects, i.e., 

ind(E,A) = min{n G N : Vfc > n Qfe(E, A) = 0}. 

backgroundcolor=grccn!40]add remark on index for DAEs? 
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3.7. Defects and Invertibility. The choice of the name "defect" may seem overly 
negative, but those integers really measure how far this system is from a system 
where E is invertible. This is the essence of the following proposition. 

Proposition 3.18. For a given system (E, A) the following statements are equiva- 
lent. 

(i) All the defects a, (3 + and /3~ are zero. 

(ii) The operator E is invertible. 

Proof. E is surjective if and only if AV' = 0. By Lemma 3.13, that is equivalent 
to a = [3 + = 0. Since E is invertible if and only if both E and E* are surjective, we 
obtain the result using Definition 3.7. □ 

3.8. Regular Pencils. A pencil is a polynomial on a ring of matrices. Since we are 
interested in pairs of matrices, our attention is restricted to first order polynomials, 
and to the property of such a polynomial to be regular. 

Definition 3.19. The system (E, A) is a regular pencil if there exists A € C such 
that AE + A is invertible. 

There is a remarkable relation between the property of being regular and the 
defects: 

Proposition 3.20. The system (E, A) is a regular pencil if and only if all the 
defects (3 + and (3~ are zero. 

We need first a lemma to understand how the pencil regularity property may be 
lost during the reduction. 

Lemma 3.21. The system (E, A) is a regular pencil if and only if both the following 
properties hold: 

(i) /3+(E,A)=0 

(ii) The reduced system (E, A)' is a regular pencil 

Proof. (1) Consider, for any A € C, the operator Sa defined by 

S A := AE + A. 

Sa can be decomposed into S' x and [Sa] according to the following commut- 
ing diagram: 



> M' * M > M/M' > 

Sa Sa [S a ] 
► V »• V ► V/V > 



Since both rows are exact sequence, out of the three operators S^, Sa 
and [Sa], if two of them are invertible then the third one is. One easy way 
to prove this fact 2 is by choosing bases in M and V which are compatible 
with the subspaces M' and V . The operator Sa is then represented by a 
block triangular matrix where the diagonal blocks arc the matrices of S A 
and [Sa]- Now it is easy to check that if two of those three matrices are 
invertible, the third one is. 



2 



This is a very general result that holds in other contexts as well, since one may also prove it 
by diagram chasing. 
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(2) Notice that for any A € C, [Sa] = [A] (the operator [A] is defined in Propo- 
sition 3.1), so [Sa] is invertible if and only if + =0. As a result, we obtain 
the property 

j3+ = => [VA e C S A invertible S' A invertible] . 

(3) For any A e C, the surjectivity of Sa implies that of [Sa]- Since [Sa] does 
not depend on A, it means that if [Sa] = [A] is not surjective, then Sa is 
not surjective for any A € C. Now since, by definition, if j3 + ^ then [A] 
is not surjective, we conclude that 

[3 + ^ => VA e C Sa not invertible. 

All the possibilities are covered and the claim is proved. □ 

Proof of Proposition 3.20. (1) We first show by induction on the index that 
(E, A) is a regular pencil if and only if f3 + = and (E, A)(°°) is a regular 
pencil. It is easy to show using Lemma 3.21. 

(2) Now a system (E, A) is a regular pencil if and only if the dual system (E, A)* 
is a regular pencil, so we may apply on (E^°°'*, A^ 00 )*) the claim just proved. 
Because of Definition 3.7, we obtain that (E, A) is a regular pencil if and 
only if (3 + and (3~ are zero, and (E^M 00 ), A^ 00 ^ 00 )) is a regular pencil 

(3) Since, by Proposition 3.8, E(°°)*(°°) is invertible, the system (E(°°)*(°°), A(°°M ( 
is a regular pencil, and the claim is proved. 

□ 

4. Coupling 

4.1. Motivation: coupling spaces. In Section 2 we showed how to define invari- 
ant subspaces for the system (E,A). "Invariant" means here that those subspaces 
arc not arbitrarily chosen, they depend in a unique way from the system at hand. 

In order to obtain a simple matrix representation of that system, we will need to 
choose supplementary spaces to the invariant subspaces M' and V . In this section, 
we focus on such supplementary spaces for one reduction step only, and establish 
some results which will be needed in Section 6. 

We first look at the case of supplementary subspaces to the subspace M' , i.e., 
subspaces N' C M such that 

M = M' 8 N'. 

The strategy is to try and choose N' in the same direction as the part of ker E 
that remains out of M' . First we define what this space is by decomposing the 
kernel of E in the part that is included in M' and some supplementary space. This 
is achieved by choosing any supplementary space K' such that 

kerE = (kerEnikf')®i<". 

Then since, by construction, K' n M 1 = one may complete M' by choosing a 
supplementary space C such that 

M = M'@C'®K'. 

We now define N' as 

N' := C 8 K'. 

The choice of C will prove to be essential to obtain a complete decomposition 
of the system (E, A). The tool to choose C appropriately will be Lemma 4.2. 
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But notice now that no matter how we choose C", the space K' roughly speaking 
corresponds to the variables that are decoupled from the rest of the system. They 
are sometimes called the algebraic constraints. 

Example 4.1. Let us illustrate the previous remark by a trivial example. Consider 
the simple system 

x' = x 
y = 0. 

The variable y is decoupled from the rest of the system. 

4.2. Coupling Lemma for E. We will assume that some coupling space W" has 
already been chosen in the reduced system, and that will serve as a starting point 
for the choice of the coupling space at the present stage. More precisely, we assume 
that the reduced space V is already decomposed as 

V' = EM = EM' W". 

That decomposition allows to construct the coupling spaces in an optimal manner, 
in one subspace C coupled with W", and complement with vectors in the null-space 
of E. 

We state the result in a lemma, formulated outside the context of linear systems. 

Lemma 4.2. Assume that an operator E acting on a space M, and consider a 
subspace M' C M. For any subspace W" such that EM — EM' ® W" there exists 
subspaces K' and C such that 

M = M' @C @K' 

and such that the sequence 

> K' > K' 8 C — W" * 

is exact. The exactness means here that ker E n (K' C) = K' and E(K' ® C) — 
W". 

Moreover, for any choice of basis in W" one may choose a basis of C such that 
its image by E is the basis in W" (see Figure 1). 

Proof. (1) Consider 

C 7 := E^W" = {xeM: Ex £ W"). 

Observe that M' + C 7 = M, and EC 7 = W". 

(2) Pick x € UnM'. It implies that Ex € W"f]EM', so Ex = 0, i.e., x e ker E. 
We conclude that C 7 n M 1 C ker E. 

(3) Choose C such that C = ker EffiC It follows from the previous observa- 
tion that C'nl' = 0. This implies 

M = (M' + kerE)®C. 

(4) Decompose further ker E as 

kerE = (kcrEnM')©^". 
As a consequence, we obtain 

M' + ker E = M' ® K' 
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M 



M' 


a 


K> 





EM' 



EM 



W" 



Figure 1. An illustration of Lemma 4.2. There is a basis choice 
such that the operator E is represented as this matrix. Blue squares 
are identity matrix blocks. Other areas are filled with zeros. 



and K' C ker E. 

(5) Finally we have EC" = EC 7 = W", and C n ker E = 0. Moreover, since E 
restricted on C sends C bijectively to W, the inverse image of the basis 
of W" is a basis of C . 

□ 

4.3. Complementary subspaces. The definition of M' , in Definition 2.3, is the 
set of vectors in M such that Ax intersects with the image of E. We are now 
interested in the converse statement, namely, that if we choose a subspace N' 
which does not intersect M' , its image AN' by A should not intersect the image of 
E. This is the gist of the following lemma. 

Lemma 4.3. Consider a linear system (E, A) and assume that N' is a subspace of 
-^(E,A) that does not intersect M'^^, i.e. such that 

N' n M ( ' EiA) = 0. 

Then the property 

a^v' n v( EA) = o 

holds. 

Note that this result is a consequence of Proposition 3.1. Indeed, the subspace N' 
may be injected in M/M' and the result follows from the fact that [A] is injective. 
We give also a direct proof of this elementary lemma. 

Proof. For general subspaces N' C M and W C V, we have 

AN 1 n W = A(N' n A^W'). 
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With W = V and since by Definition 2.3, M' = A" 1 !/', we obtain 

AN' D V = A(N' n M') 
from which the claim follows. □ 

4.4. Coupling Lemma for Systems. We may now combine Lemma 4.2 and 
Lemma 4.3 and obtain a fundamental Lemma that decouples the operators E and 
A on supplementary spaces to M' and V . 

Lemma 4.4. Suppose that there is a decomposition 

V = V" © W" , 

and that W" is equipped with a basis. 
Then there exists decompositions 

M = W N', 

v = v e W, 

and subspaces 

C CM D' C V, 

K' CM Z' C V, 

such that 

(7) n' = C e x', 

(8) W = D' e z'. 

Those subspaces are such that the following sequences are exact: 

E 



and such that 



N' D' © Z' > Z' 

w 



AM n Z' = 0. 

Moreover, one may choose basis in the subspaces C , K' , D' and Z' such that 
the basis of D' is the image by A of the basis of N' , and the basis on W" is the 
image by E of the basis of C . 

Proof. (1) By the assumption on W", we have 

EM = V' = V" © W" = EM' © W". 

The subspace W" is moreover equipped with a basis by the induction hy- 
pothesis. 

(2) Appealing to Lemma 4.2 we obtain subspaces C and K' such that 

M = M' © C © K' 

with 

EC" = W" 

and 

EK' = 
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and 

kcr E n C = 0. 

Note that, given a basis in W" we can choose a basis on C such that E 
sends that basis on that of W". 

Let us now define the subspace N' C M by 

N' := C 8 K'. 

We choose an arbitrary basis of the space K' , and this provides us with a 
basis for the space N'. 

(3) Recall now that, according to Lemma 4.3, 

AN' n EM = 0, 

and by Proposition 2.10, the operator A sends the basis of the space N' to 
a set of independent vectors in the space V. We thus choose as a basis of 
AN' the image of the basis of N' by A. 

(4) Now choose a subspace Z' C V such that 

V = EM AN' e 
and pick an arbitrary basis of that subspace. We define 

D' := AN' 

and 

W :=D'®Z'. 

□ 

Remark 4.5. The dimensions of the spaces introduced in Theorem 6.1 are related 
to the dimensions of the spaces introduced in Definition 3.11, and to the defects 
(Definition 3.2). The relations are given by 

dim W' = dim AV' dim Z' = ai , 

dim N' = dim AM' dim K' = 0? . 

5. Strangeness 

In order to illustrate the power of reduction, and to show an application of 
Lemma 4.4, we show an intermediate result. Instead of looking at the equivalence 
classes for the equivalence of matrices, that is, pairs of invertible operators acting 
on (E, A) as (PEQ, PAQ), we look at the weak equivalence. 

Weak equivalence is determined by another group, which elements consist of two 
invertible operators P and Q and an arbitrary operator R, acting on a system (E, A) 
as 



(9) 



(P,Q,R)-(E,A) := (PEQ, P(ER + AQ)). 
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5.1. Weak equivalence group. The group operation corresponding to weak equiv- 
alence is given by 

(P 2 ,Q 2 ,i? 2 )- (Pi,Qi,i?i) = (P 2 Pi,QiQ2,0ii?2 + ^iQ 2 ), 

where Pi, P2 are automorphisms of V, Qi, Q2 are automorphisms on M, and Ri, 
R2 are arbitrary endomorphisms on M. 
The identity is then 

(i,i,o), 

and the inverse of an element (P, Q, R) is given by 

(P, Q, R)- 1 = (P- 1 , Q- \ -Q-^Q- 1 ). 

Clearly, the elements of the form (P, Q, 0) form a subgroup corresponding to the 
equivalence relation. Another subgroup is given by elements of the form (1,1, R). 
For the study of the orbits of the weak equivalence group, the identity 

(10) (P, Q, R) = (P, Q, 0) • (I, I, RQ- 1 ) = (I, I, Q~ l R) ■ (P, Q, 0) 

shows that we may restrict our attention to one subgroup at a time. 

5.1.1. Orbit Invariants. The orbits of the weak equivalence group action (9) were 
studied in [7], in which the authors exhibited a complete set of invariants. We give 
an alternative proof here, thereby shedding some light on the notion of strangeness. 

Theorem 5.1. A complete set of invariants for the group action (9) is given by 

(1) d:= dimV", 

(2) a := dimker[E] = a l7 

(3) s := dimAU". 

The integer s is called "strangeness" in [7]. 

Proof. (1) First we have to check that the three integers are indeed invariants 
of the group action. Clearly, they are invariants by transformations of the 
form (P, Q, 0), which are merely equivalent transformation. 
Let us examine the case of a transformation 

(E,A) = (E,I,R)-(E,A) = (E,Ei2 + A). 

We have 

V' = EM = EM = V', 
m' := {x : Axe EM} = M', 

so 

(E',A') = (E,A) 

and 

V" = I'M' = EM' = V". 

Using (10), this shows that the spaces V, V" and M' are invariants of all 
of the weak equivalence group transformations. 

As a result, the spaces AV" and the operator [E] are also invariants, so 
the integers d, a and s are invariants. 
(2) Now we show that the integers d, a and s are the only invariants. In order to 
show that, we show that a system (E, A) is weakly equivalent to a canonical 
form that depends only on those three integers. 

In order to achieve this, we decompose M and V using Lemma 4.4. 
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M' C K' 



V" 



D' 



Z' 



Figure 2 . Canonical form of a matrix corresponding to the weak 
equivalence. The matrix E is represented in blue, whereas the 
matrix for A is represented in green. All such squares are identity 
matrices. The rest is filled out by zero entries. 



(a) Let us choose an arbitrary decomposition 

V = V" 8 W". 

We may now apply Lemma 4.4 to obtain spaces C", K' , D' and Z' 
equipped with appropriate bases. Using Remark 4.5 we obtain s = 
dimW" and a — dim if'. 

(b) Finally, define II as a projector from M to M' along N' . Let F be a 
right inverse for E on V' = EM. 

Define 

R := -FAII, 

so ER + A = on M'. 

As a result, if we define the new system (E, A) by 
(E,A) := (E, ER + A), 

then the restriction of A on M' is zero. 

(c) Now we may choose a basis of M' and of V" = EM' such that E is 
represented by the identity matrix on M' . 

This provides us with complete basis of M and V such that the ma- 
trices E and A take the form described in Figure 2. 

□ 

6. Direct Decomposition 

6.1. Decomposition Theorem. In Section 2 we showed how to define invariant 
subspaces and for the system (E,A). In order to obtain a complete 

decomposition of the spaces M and V, it is necessary to construct subspaces that 



d 










s 




s 


- 

a 
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bridge the gap between each invariant subspaces and . More precisely we 
construct spaces an d such that 

M (k) = M (k+i) Ar (fe+i) ! y(k) = y(k+i) w (fe+i)_ 

In a sense, the subspaces and correspond to the spaces AMW and AV( fe ) 
respectively, defined in Definition 3.11. 

The construction of those supplementary spaces proceeds backwards, in the di- 
rection opposite to the reduction. One must first totally reduce the system (E, A). 
Assume that the index is n. One then chooses an arbitrary complementary space 
W [n) such that V^"" 1 ) = V [n) ® W [n) , and equip that space with an arbitrary 
basis. The rest is a repeated application of Lemma 4.4. 

We now see how to choose those supplementary spaces and so that 

the operators E and A are simultaneously decomposed in an advantageous way. 

Theorem 6.1. Consider a system (E, A). 

Recall the definitions of the subspaces and in (3). 

For any integer k € N there exists subspaces A^ fe+1 ) C M and W^- k+1 ^ C V such 
that 



M {k) = M (fc+1) e M k+1 \ 

y{k) = y(k+l) W (k+l)^ 

and for any integer k > 1 there exists subspaces 

C {k) C M c V, 

K {k) C M Z {k) C V, 

such that 

(11) = C (k) ®K {k \ 

(12) W (k) = D {k) e Z {k) . 

Those subspaces are such that for any integer k > 1, the following sequences are 
exact (see Figure 3): 

* > K {k) e c {k) -^-» w( k+ v > 

S v ' 

, AT(fc) D (k) e z w * z« > 



WW 

and such that 



AM c=-i) n ^(fe) = o. 

Moreover, one may choose basis in the spaces C^ k \ K^ k \ and such 

that the basis of is the image by A of the basis of N^ k \ and the basis on W^ k+1 ^ 
is the image by E of the basis of . 
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M<~' A"" N" A" 

C"'K"' C" K" C" A"' 

A = I, E = 



E = I, A = 



E = I, A =? 



E = 0,A=? 



E = 0, A = 



E = 0, A = 



Figure 3. An illustration of the decomposition described in The- 
orem 6.1. Noticing that no matter what bases we choose in Af(°°) 
and V^ 00 -* the matrix is block diagonal (see Corollary 6.4), we may 
choose those bases in such a way that E is represented as the iden- 
tity matrix on that block. Since we now that E^ 00 ) is surjective 
(Proposition 2.23), that identity block stretches to fill 



Remark 6.2. For the reader averse to the language of exact sequences, the fact that 
the sequences of Theorem 6.1 are exact means in that case that 

EC {k) = W {k+1 \ 

£K {k) = 0, 

kN {k) = D {k \ 

kcr A n N {k) = 0. 

Remark 6.3. In the same spirit as Remark 4.5, we notice the relation between the 
dimensions of the various subspaces introduced in Theorem 6.1, and the dimensions 
of the spaces defined in Definition 3.11, and to the defects (Definition 3.2). For any 
integer k > 1, the relations are given by 

dim W (k) = dim AV (k) dim Z {k) = a k , 

dimiV (fe) = dimAM (fc) dimif (fc) = /3+. 

Proof of Theorem 6.1. We proceed by induction on the index (see Figure 4). If the 
index is zero, all the spaces AfW and 

are zero, and there is nothing to prove. 
Assume now that the statement holds for systems of index n — 1. Given a system 
(E, A) of index n, the reduced system (E', A') has index n — 1, so we may apply the 
induction hypothesis on that reduced system. 
For clarity, let us denote 

(E,A) := (E',A'), M = M', V = V . 
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M' 



N' 



M(-) N'" N" C K' 



V 



w 



y(°°) 

W"'[ 
W" 

D' 
■ Z 1 



A = I, E = 
E = I, A = 
E = I, A =? 
E = 0,A=? 
E = 0, A = 
E = 0, A = 



Figure 4. An illustration of Theorem 6.1 and Lemma 4.4 on an 
index three system. The grey shaded part pictures the previous 
step of the recursion. Starting with W" , one constructs the space 
C such that EC" = W" , and a subspace K' such that K' C ker E 
using Lemma 4.2, and defines N' := C ®K' . One then constructs 
Z' such that V = V AN' Z'. This in turn defines W := 
AN 1 8 Z'. 



The reduced system (E,A)' consists of operators operating from M' to V , so by 
the induction hypothesis we obtain a decomposition of the spaces M and V into 

subspaces , N as described in the statement of the theorem. 

We have to shift the indices of all the spaces produced for the final statement to 
hold. For example, we define for any integer k > 2 

;= 

so we may write the decomposition of V' as 

V = V = V {oo) ffi W {n) © VF ( ™- 1} © • • • © W". 

The reduced operators E' and A' being restrictions of E and A, the statements 
obtained from the induction hypothesis apply to the operators E and A. 
Applying Lemma 4.4 yields the desired result. 

□ 

6.2. Decomposition in invariant subspaces. A crucial consequence of Theo- 
rem 6.1 is that it provides us with decompositions of M and V such that E and A 
may be restricted on those subspaces: 

Corollary 6.4. Given the decomposition provided by Theorem 6.1, and defining 
M and V by 
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then, by construction, 



M = M (oo) M, 



V = V {oo) @ V, 



and we have 



(i) 



em (oo) = y (oo) 



AM (oo) c V {oo) 



(ii) 



EM c V 



AM c V 



Proof. The fact that EM C V and AM C V follows from 

EAT( fc ) c W( k+1) C F 



and 



AiV (fe) c c V. 



□ 



7. Dual Decomposition 

7.1. Dual space decomposition. Assume that a finite dimensional vector space 
M is decomposed in a direct sum of subspaces, i.e., 

m = Mi e m 2 e • • • e m„. 

This decomposition induces the dual space decomposition 



Although it is not reflected by the notation, it is clear that (M k )* actually 
depends not only on M& but on all the other spaces of the decomposition. It is a 
generalization of the notion of dual basis. 

Assume further that M is equipped with a basis. We say that this basis is 
compatible with the decomposition if each subspace is the span of a subset of the 
basis. 

If M is equipped with a basis B compatible with a subspace decomposition, then 
the dual basis is compatible with the dual space decomposition. 

Indeed, consider a subspace M& of the decomposition. Since the basis is com- 
patible with the decomposition, that subspace is spanned by a subset of the basis 
B, say S Mk C B, i.e., 



The dual decomposition is such that the associated subspace (M^)* is the span of 
the dual basis with the same subset S]\j k , i.e., 



Here the covector e* is the element of the dual basis of B corresponding to e, i.e., 
such that 



(M fc ). := (0M J ) ± = { 



<pEM* : (<p, x) = Vx e Mj 



M fe = spanS Mfc . 



(M fe )* = spanje* : e <G S Mk }- 
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Lemma 7.1. Assume that A and B are subspaces of M that are part of a subspace 
decomposition of M, and that C is a subspace ofV that is part of a subspace decom- 
position of V . Assume further that M and V are equipped with bases compatible 
with their decompositions. Take an operator S operating from M to V . 
The following two statements are equivalent: 

(i) The following sequence is exact: 

* A * A®B — C > 

Moreover, the operator S sends the basis of B on the basis of C. 

(ii) The following "dual" sequence is exact: 

* C, — A* © B, > A, > 

and 

S*V* n A* = 0. 

Moreover, the operator S* sends the basis of C* on the basis of ' B*. 

Proof. Denote the basis on M and V by B(M) and B(V) respectively. Clearly, for 
any e € B(M) and / e B(V), we have 

(/*,Se) = <S*/*,e> = <(e*)*,S7*) 

where (e*)* is the dual basis of the dual basis of B. 

The proof is now a simple verification by expressing each of the statements in 
terms of the bases. For example, SA = may be written as 

(/*,Se>=0 VeeS A f e B(V), 

so one obtains 

((e*)*,S*/*>=0 VeeS A f € B(V), 

which means that SA = S*V* fl i, = 0. The other statements are verified 

in the same fashion. □ 

7.2. Conjugate Decomposition. Consider a finite dimensional linear system (E, A) 
and its dual (E*, A*). The corresponding domain and codomain are denoted by 

M := M (EiA) . = V*, 

V := V (E ,A). = M*. 

By applying Theorem 6.1 on the dual system (E*, A*) one obtains a decomposi- 
tion of M and V as 

M = M (oo) © K' © C' © • • • , 

v - F (oo) © z' © D' © • • • . 

Moreover, all those subspaces are equipped with a suitable basis. By choosing a 
basis for the spaces M (oo) and F (oo) we obtain compatible bases B(M) and B(V) 
of M and V respectively. 

Theorem 7.2. Consider the decompositions produced by Theorem 6.1 for the dual 
system (E, A)*. The dual decompositions induce decompositions of the spaces M 
and V by the canonical isomorphism between a space and its bidual. 
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For any integer k <G N we have 

Mi k) =Mi k+1) ®Ni k+1) 

K (fc Wj fc+1 W (fc+1 \ 

Those subspaces are such that the following sequences are exact (see Figure 3): 

o — Ki k) e d fe) — K {k) — 

s v ' 

o > z {k) ► Di k) e z< fe) o 



Moreover, the basis of C[ k \ K^ k \ D^'and Z^ are such that the basis of 
is the image by A of the basis of and the basis of is the image by E of 

the basis of W^ k+1 ^ . 

Proof. It is a direct application of Lemma 7.1. □ 

Remark 7.3. The exact sequences of Theorem 6.1 are the same as those of Theo- 
rem 7.2 but with flipped arrows. It just reflects how the block structure of a matrix 
is related to the block structure of the transposed matrix. 

7.3. Second sweep of the decomposition. Remember that the constraint de- 
fects of the adjoint of the totally reduced system (E^ 00 )*, A^°°^*) are zero (Proposi- 
tion 3.6). Along with Remark 6.3, we conclude that the corresponding subspaces 
K(k) produced in Theorem 6.1 are zero. 

Now, using Corollary 6.4, we are in a position to use the decomposition of The- 
orem 6.1 for the dual system (E^ 00 )*, A^ 00 )*) and obtain a decomposition of M^* 
and V(°°)*. 

Theorem 7.4. In addition to the decomposition given by Theorem 6.1, the spaces 
M(°°) and y(°°) may now be decomposed as (see Figure 5): 

o > zw > z& e -A> — > o 

s v ' 

> VK( fc+1 ) ^— ► Z^ > 



Proof. □ 

Remark 7.5. The various defects defined in Definition 3.2 and Definition 3.7 may 
now be pictured clearly using the Theorem 7.4; see Figure 6. 

8. Kronecker Indices 

8.1. Basis Arrangement. In this section we prove a result on the basis obtained 
in Theorem 6.1, which will be useful to determine the relation with the Kronecker 
decomposition theorem. 
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A = I, E = 
E = I, A = 
E = I, A =? 
E = 0,A=? 
E = 0, A = 
E = 0, A = 



Figure 5. An illustration of the full decomposition of Theo- 
rem 7.4. The first decomposition leads to M" and the correspond- 
ing space V" = EM", at which point the algorithm stalls. The 
second step consists in transposing the reduced operators E^ 00 ) and 
/\(oo) (indicated by the bold red frame on the figure), running the 
same algorithm, and transposing back again. The upper left check- 
ered area denotes the identity for E, and a non specific matrix for 
A. Notice that this block is completely separated from the rest, 
so one may now reduce the A to Jordan blocks by a similarity 
transformation. 

Definition 8.1. For k G N, k > 1, we define a N^-sequence to be a sequence 

rrij G M 1 < j < k 
of k independent vectors in M, and a sequence 

Mj G V l<j<k 

of k independent vectors in V such that Arrij = for 1 < j < k, Errij = vj-i for 
2 < j < k, Emi = and v^. ^ Im E, which is summarized in the following diagram. 

n E A E E A, TC 
< m x ► V! < • • • < rrifc »• Vj, f. im E 

Similarly, for k G N, k > 1, we define a L^- sequence to be a sequence 

rrij G M 1 < j < k - 1 
of k — 1 independent vectors and a sequence 

Vj G V 1 < j <k 

of fc independent vectors which fulfill the conditions summarized in the following 
diagram. 

E A E A 

Im A ^ Vi < mi * ■ ■ ■ < rrife_i > Vfc ^ Im E 



JV.M" 

W" 
W" 

w 
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Theorem 8.2. Theorem 6.1 produces bases such that there are a k N ^-sequences, 
and (3^ L k -sequences. Moreover, the end vectors v constitute a basis of W . 

Proof. (1) We proceed by induction on the index. Assume that the result holds 
for the reduced system (E', A'). 

(2) The basis of C is precisely such that Em k+ i = v fe . Besides, the basis of 
AC" is chosen such that v„ + i = Am^i, so each L k and N k sequence is 
extended with two elements, meaning that they build now L k +i and N k +i 
sequences. 

(3) Each element m of the basis of K' produces a new N\ sequence (m,Av), 
since Em = and Am = v ^ Im E: 







v^ImE 



The dimension of K' being ai, we produce ct\ such sequences. 

(4) Each element v of the basis Z' qualifies as a L sequence, since v ^ ImE 
and v ^ Im A, so 

Im A ^ v ^ Im E. 

Since the dimension of Z' is fi^, we produce fi^ such sequences. 

(5) We conclude using Definition 3.3 

□ 

8.2. Kronecker Decomposition. The Kronecker canonical form makes use of 
special blocks, each of which having a variant for the matrices E and A. 

Definition 8.3. The rectangular bidiagonal blocks L|E part of the Kronecker 
L-blocks and A part of the Kronecker L-blocks defined by 



1 

1 



1 




> k, 



L A 





1 



1 

1 



\ k. 



The nilpotent blocks N|E part of the nilpotent blocks and IM^A part of the 
nilpotent blocks defined by 



10 1 




1 

1 



N 



> fc, 



N 



\ k. 



Definition 8.4. A Kronecker decomposition of the system (E, A) is a choice of 
basis of M and V such that E and A are decomposed in blocks of the same size 



I 
E 



J 0' 

A 



28 



OLIVIER VERDIER 



where J is a diagonal block of Jordan blocks, and E and A are in diagonal block 
form 

E = diag(N^ . .., N^, , L E kp , (L^) T , . . . , (L^ ) T ), 

A = diag(Nt . . . , N£ m , Lt, ■ • • , L£ p , (L£j T , . . . , (!_£/), 
where the blocks of E and A have the same size. 

Theorem 8.5. A decomposition with defects a, f3 + , /3~ , produces a Kronecker 
decomposition which for all integer k > 1 contains 

• ctfc block of type N k , 

• /3fc blocks of type L k , 

• /3 k blocks of type \J k . 

Proof. Recall the definition of L k and N k sequences in Definition 8.1. By regrouping 
the elements of an L k sequence, one obtains a representation of E and A as a Lfc- 
block, and similarly, by regrouping the elements of a N k -sequence, one obtains a Nfc 
block. Applying Theorem 8.2, and regrouping the basis elements stemming from 
the sequences L k and N k we obtain a k IM fe -blocks and (3 k L fc -blocks, for k > 1. 

Now the basis on the sub-block M^°°\ V^- 00 ^ are obtained by transposing the 
decomposition given by Theorem 6.1. Using the previous step and Proposition 3.6, 
we obtain j3 k transpose of Lfc-blocks, for k > 1. □ 

Remark 8.6. It is remarkable that the decomposition obtained in Theorem 7.4 
produces basis vectors which are the same as for a Kronecker decomposition, only 
ordered differently. The necessary permutations may be visualised on Figure 6. 

8.3. Conjugate Decomposition. We may now show the relation between the de- 
fects a, f3 + and f3~ of a system (E, A) and the defects of the adjoint system (E*, A*). 
It turns out that the constraint defects are the same and that the observation de- 
fects /3 + and the control defects (3~ are just switched. This fact would have been 
very difficult to prove from the results of Section 3 alone, so we need the full power 
of Theorem 7.4 and of its consequence, Theorem 8.5. 

Theorem 8.7. The conjugate decomposition switches the defects (3 + and (3~ , i.e., 
it produces the defects 

a(E*,A*) = a(E,A), 
/?+(E*,A*)=/T(E,A), 
/T(E*,A*)=/3+(E,A). 

Proof. It is a consequence of Theorem 8.5, for when putting the system in Kronecker 
form Definition 8.4 and transposing, the system is still in Kronecker form, but the 
bidiagonal blocks L and (L) T are switched. □ 

8.4. Weierstrafi decomposition. In the case of regular pencils (see subsection 3.8), 
the Kronecker decomposition is called the Weierstrafl decomposition ([16, 1]) and 
is such that E and A take the matrix representation 



"I 0" 




"C 0" 


N 


A = 


I 



where C may be in Jordan normal form and N is a block diagonal matrix of blocks 
of type Nf. 
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S Pi 03 02 Pi "3 "2 "1 



Pt1 



Pl\ 



Ptt 

Figure 6. An illustration of the defects a, f3 + and j3~ and of the 
Kronecker decomposition described in Theorem 8.5. The difference 
of size of the squares is exactly given by the defects a, j3 + and f3~ . 
The dark squares bearing the number j represent all the nilpotent 
blocks N j ; there are otj such blocks. The light squares in the lower- 
right part bearing the number j represent the L-blocks L, . There 
are 0f such blocks. The light squares in the upper-left part bearing 
the number j represent the L-blocks L^. There are (3~ such blocks. 
This figure also allows to check the formulae of Proposition 3.15. 







3 



4 4 

~ ~3| 




A = 


I. E 


= 


E = 


I, A 


= 


E = 


I, A 


=? 


E = 


0,A 


=? 


E = 


0, A 


= 


E = 


0, A 


= 



The matrix block NMatrix of nilpotent blocks is a diagonal block matrix 

N = diag(Nf i (0),Nf 2 (0),...,Nf m (0)) 

where the blocks (N^(0) are the nilpotent blocks defined in Definition 8.3. 

Corollary 8.8. The Weierstrafi decomposition is such that for any integer k > 1 
it contains blocks N|. 

Proof. It is just a special case of Theorem 8.5 using Proposition 3.20. □ 

9. Conclusions 

We have defined the notion of defects and have related them to existing concepts, 
such as the regular pencil condition, the dimension of the reduced subspaces, or the 
notion of strangeness. We also showed how the defects define a normal form, and 
how that normal form relates to the existing one of Kronecker. 

Note that some results, as Theorem 8.7, would be difficult to prove without using 
the canonical form. Nevertheless, we tried to wring the most out of the invariant 
objects defined in Section 3. 

The advantage of such an approach is that it is extensible to nearby cases such 
as the parameter dependent case, or the infinite dimensional case (see [15]). 
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